Search CORE

41 research outputs found

Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery

Author: D Mantini
E Petricoin
Henry Han
HW Ressom
J Nocedal
JS Yu
KR Coombes
M Gonen
M Hauskrecht
P Hoyer
R Lilien
R Zass
T Alexandrov
V Vapnik
X Han
X Han
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background As a novel cancer diagnostic paradigm, mass spectroscopic serum proteomic pattern diagnostics was reported superior to the conventional serologic cancer biomarkers. However, its clinical use is not fully validated yet. An important factor to prevent this young technology to become a mainstream cancer diagnostic paradigm is that robustly identifying cancer molecular patterns from high-dimensional protein expression data is still a challenge in machine learning and oncology research. As a well-established dimension reduction technique, PCA is widely integrated in pattern recognition analysis to discover cancer molecular patterns. However, its global feature selection mechanism prevents it from capturing local features. This may lead to difficulty in achieving high-performance proteomic pattern discovery, because only features interpreting global data behavior are used to train a learning machine. Methods In this study, we develop a nonnegative principal component analysis algorithm and present a nonnegative principal component analysis based support vector machine algorithm with sparse coding to conduct a high-performance proteomic pattern classification. Moreover, we also propose a nonnegative principal component analysis based filter-wrapper biomarker capturing algorithm for mass spectral serum profiles. Results We demonstrate the superiority of the proposed algorithm by comparison with six peer algorithms on four benchmark datasets. Moreover, we illustrate that nonnegative principal component analysis can be effectively used to capture meaningful biomarkers. Conclusion Our analysis suggests that nonnegative principal component analysis effectively conduct local feature selection for mass spectral profiles and contribute to improving sensitivities and specificities in the following classification, and meaningful biomarker discovery.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Graph similarity through entropic manifold alignment

Author: Barrow H. G.
Cho M.
Cour T.
Duchenne O.
Edwin R. Hancock
Escolano F.
Francisco Escolano
Jiang B.
Kelsey J.
Leordeanu M.
Martins A.
Miguel A. Lozano
Nadler B.
Sanfeliu A.
von Luxburg U.
von Luxburg U.
Zass R.
Zhou F.
Zhou F.
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2017
Field of study

In this paper we decouple the problem of measuring graph similarity into two sequential steps. The first step is the linearization of the quadratic assignment problem (QAP) in a low-dimensional space, given by the embedding trick. The second step is the evaluation of an information-theoretic distributional measure, which relies on deformable manifold alignment. The proposed measure is a normalized conditional entropy, which induces a positive definite kernel when symmetrized. We use bypass entropy estimation methods to compute an approximation of the normalized conditional entropy. Our approach, which is purely topological (i.e., it does not rely on node or edge attributes although it can potentially accommodate them as additional sources of information) is competitive with state-of-the-art graph matching algorithms as sources of correspondence-based graph similarity, but its complexity is linear instead of cubic (although the complexity of the similarity measure is quadratic). We also determine that the best embedding strategy for graph similarity is provided by commute time embedding, and we conjecture that this is related to its inversibility property, since the inverse of the embeddings obtained using our method can be used as a generative sampler of graph structure.The work of the first and third authors was supported by the projects TIN2012-32839 and TIN2015-69077-P of the Spanish Government. The work of the second author was supported by a Royal Society Wolfson Research Merit Award

Repositorio Institucional de la Universidad de Alicante

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

White Rose Research Online

Seeking Affinity Structure: Strategies for Improving m-best Graph Matching

Author: Batra
Batra
Cho
Cho
Cour
Edwin R. Hancock
Estrada
Francisco Escolano
Fromer
Gold
Hu
Kirillov
Kirillov
Kirillov
Lawler
Lawler
Leordeanu
Leordeanu
Lyzinski
Manuel Curado
Meltzer
Miguel A. Lozano
Nilsson
Park
Ramakrishna
Rezatofighi
Rezatofighi
Schellewald
Sun
Szeliski
Vogelstein
Yanover
Zaslavskiy
Zass
Zhang
Zhou
Publication venue: 'Elsevier BV'
Publication date: 10/09/2019
Field of study

State-of-the-art methods for finding the m-best solutions to graph matching (QAP) rely on exclusion strategies. The k-th best solution is found by excluding all better ones from the search space. This provides diversity, a natural requirement for transforming a MAP problem into a m-best one. Since diversity enforces mode hopping, it is usually combined with a mode-approximation strategy such as marginalisation. However, these methods are generic insofar they do not incorporate the detailed structure of the problem at hand, i.e. the properties of the global affinity matrix which characterise the search space. Without this knowledge, it is thus hard to devise a practical criterion for choosing the next variable to clamp. In this paper, we propose several strategies to select the next variable to clamp, spanning the whole range between depth-first and breadth-first search, and we contribute with a unifying view for characterising the search space on the fly. Our strategies are: a) Number of factors in which the variables participate, b) centrality measures associated with the affinity matrix, and c) discrete pooling. Our experiments show that max number of factors and centrality provide a trade-off between efficiency and accuracy, whereas discrete pooling leads to an improvement of the state-of-the-art

Repositorio Institucional de la Universidad de Alicante

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

White Rose Research Online

G-Protein Coupled Receptor Signaling Architecture of Mammalian Immune Cells

Author: A Broder
A Kel
AG Gilman
E Wingender
Frank Nielsen
H Kitano
H Ma
H Ma
Hiroaki Kitano
J Shi
JD Jordan
K Oda
K Oda
KA Janes
M Csete
M Gilchrist
M Natarajan
ML Dustin
N Polouliakh
Natalia Polouliakh
Nils Cordes
R Nock
R Zass
Richard Nock
RK Ferreira
S Geman
S Pradervand
S Roweis
SR Neves
VC Foletta
X Liu
X Zhu
Publication venue: Public Library of Science
Publication date: 14/01/2009
Field of study

A series of recent studies on large-scale networks of signaling and metabolic systems revealed that a certain network structure often called “bow-tie network” are observed. In signaling systems, bow-tie network takes a form with diverse and redundant inputs and outputs connected via a small numbers of core molecules. While arguments have been made that such network architecture enhances robustness and evolvability of biological systems, its functional role at a cellular level remains obscure. A hypothesis was proposed that such a network function as a stimuli-reaction classifier where dynamics of core molecules dictate downstream transcriptional activities, hence physiological responses against stimuli. In this study, we examined whether such hypothesis can be verified using experimental data from Alliance for Cellular Signaling (AfCS) that comprehensively measured GPCR related ligands response for B-cell and macrophage. In a GPCR signaling system, cAMP and Ca2+ act as core molecules. Stimuli-response for 32 ligands to B-Cells and 23 ligands to macrophages has been measured. We found that ligands with correlated changes of cAMP and Ca2+ tend to cluster closely together within the hyperspaces of both cell types and they induced genes involved in the same cellular processes. It was found that ligands inducing cAMP synthesis activate genes involved in cell growth and proliferation; cAMP and Ca2+ molecules that increased together form a feedback loop and induce immune cells to migrate and adhere together. In contrast, ligands without a core molecules response are scattered throughout the hyperspace and do not share clusters. G-protein coupling receptors together with immune response specific receptors were found in cAMP and Ca2+ activated clusters. Analyses have been done on the original software applicable for discovering ‘bow-tie’ network architectures within the complex network of intracellular signaling where ab initio clustering has been implemented as well. Groups of potential transcription factors for each specific group of genes were found to be partly conserved across B-Cell and macrophage. A series of findings support the hypothesis

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Sparse hashing for fast multimedia search

Author: Gao S.
He X.
Heng Tao Shen
Hong Cheng
Jain P.
Jiangtao Cui
Kulis B.
Lee H.
Lee H.
Liu W.
Mu Y.
Muja M.
Norouzi M. E.
Raginsky M.
Tibshirani R.
Torralba A.
Wang J.
Wang J.
Weiss Y.
Wu M.
Xiaofeng Zhu
Zass R.
Zi Huang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Information retrieval and text mining technologies for chemistry

Author: Abacha A. B.
Alberts D.
Alfonso Valencia
American Chemical Society
Anália Lourenço
Aphinyanaphongs Y.
Appelt D. E.
Aramaki E.
Aronson A. R.
Asahara M.
Babych B.
Baeza-Yates R.
Bambenek J.
Barnard J. M.
Bast H.
Batista-Navarro R.
Batista-Navarro R. T.
Bian J.
Bies A.
Bikel D. M.
Blaschke C.
Brecher J. S.
Brill E.
Bunescu R.
Bunescu R. C.
Califf M. E.
Carpenter B.
Caruana R.
Chee B. W.
Chhieng D.
Chinchor N.
Chiticariu L.
Chowdhury M. F. M.
Chowdhury M. F. M.
Ciravegna F.
Cleverdon C. W.
Coden A.
Cohen R.
Collier N.
Corbett P.
Corbett P.
Cover T. M.
Craven M.
Cummings M. D.
Currano J. N.
Currano J. N.
Currano J. N.
Currano J. N.
Cutting D. R.
Davis C. H.
Dieb T. M.
Dieb T. M.
Dogan R. I.
Downs G. M.
Dunikowski L. G.
Embarek M.
Eom J.-H.
Faber J.
Fall C. J.
Fattore M.
Fennell R. W.
Freund Y.
Fujiyoshi A.
Fukuda K.
Gale W. A.
Garcelon N.
Garnier J.-P.
Garten Y.
Ginn R.
Giuliano C.
Gold S.
Grefenstette G.
Grishman R.
Gurulingappa H.
Gurulingappa H.
Gusfield D.
He Y.
Hearst M. A.
Hersh W.
Hersh W.
Hirschman L.
Hobbs J. R.
Hodge G. M.
Holzinger A.
Hsueh P.-Y.
Huber T.
Iyer S. V
Jackson P.
Joachims T.
Johnson D.
Jonnalagadda S.
Jonnalagadda S.
Julen Oyarzabal
Jurafsky D.
Kaewphan S.
Kaewphan S.
Karkaletsis V.
Katragadda S.
Kazama J.
Kazawa H.
Kelly L.
Kenny P. W.
Kim J.-D.
Kim Y.
Kleene S. C.
Kolárik C.
Kongburan W.
Kornai A.
Kraaij W.
Krallinger M.
Krallinger M.
Krallinger M.
Kremer G.
Kreuzthaler M.
Kucera H.
Lai H.
Lawson A. J.
Leaman R.
Leaman R.
Lee C.-H.
Levenshtein V. I.
Levin M. A.
Li J.
Li N.
Li Y.
Liu X.
Locke W. N.
Lovins J. B.
Lowe D. M.
Lupu M.
Lupu M.
Mackenzie C. E.
Manning C. D.
Mansouri A.
Martin E.
Martin Krallinger
Mattmann C.
Maynard D.
McCallum A.
McEwen L.
McKnight L.
McNaught A.
Meystre S. M.
Michalski S. R.
Michie D.
Mihalcea R.
Mitton R.
Miwa M.
Mollá D.
Murray-Rust P.
Müller B.
Nebel A.
Nikfarjam A.
Névéol A.
Névéol A.
Obdulia Rabal
Pang B.
Panico R.
Perez-Iratxeta C.
Ponomareva N.
Ratinov L.
Ratnaparkhi A.
Read J.
Rebholz-Schuhmann D.
Reeker L. H.
Rocchio J. J.
Rohbeck H.-G.
Rosario B.
Roth D. L.
Rupp C. J.
Rupp C. J.
Sagae K.
Salim N.
Salton G.
Sanchez-Cisneros D.
Saracevic T.
Sasaki Y.
Schapire R. E.
Schenck R.
Schenck R. J.
Schlaf A.
Schuemie M. J.
Segura Bedmar I.
Segura-Bedmar I.
Sekine S.
Sequeira E.
Settles B.
Settles B.
Sewell W.
Shen D.
Shidha M. V
Singhal A.
Smith E. G.
Stamatatos E.
Sutton C.
Sætre R.
Taylor K. T.
Tharatipyakul A.
Tomanek K.
Tomanek K.
Tsuruoka Y.
Tsuruoka Y.
Täger W.
Urbain J.
van Rijsbergen C. J.
Vapnik V. N.
Vasserman A.
Visweswaran S.
Voorhees E. M.
Wang W.
Wang Y.
Wei C.-H.
Wei C.-H.
Wermter J.
Wilbur W. J.
Willett P.
Willett P.
Williams A. J.
Witten I. H.
Workman M. L.
Wrublewski D. T.
Xu R.
Xue N.
Yan S.
Yang C.
Yang C. C.
Yang Y.
Zass E.
Zipf G. K.
Zipf G. K.
Zitnik S.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2017
Field of study

Efficient access to chemical information contained in scientific literature, patents, technical reports, or the web is a pressing need shared by researchers and patent attorneys from different chemical disciplines. Retrieval of important chemical information in most cases starts with finding relevant documents for a particular chemical compound or family. Targeted retrieval of chemical documents is closely connected to the automatic recognition of chemical entities in the text, which commonly involves the extraction of the entire list of chemicals mentioned in a document, including any associated information. In this Review, we provide a comprehensive and in-depth description of fundamental concepts, technical implementations, and current technologies for meeting these information demands. A strong focus is placed on community challenges addressing systems performance, more particularly CHEMDNER and CHEMDNER patents tasks of BioCreative IV and V, respectively. Considering the growing interest in the construction of automatically annotated chemical knowledge bases that integrate chemical information and biological data, cheminformatics approaches for mapping the extracted chemical names into chemical structures and their subsequent annotation together with text mining applications for linking chemistry with biological information are also presented. Finally, future trends and current challenges are highlighted as a roadmap proposal for research in this emerging field.A.V. and M.K. acknowledge funding from the European Community’s Horizon 2020 Program (project reference: 654021 - OpenMinted). M.K. additionally acknowledges the Encomienda MINETAD-CNIO as part of the Plan for the Advancement of Language Technology. O.R. and J.O. thank the Foundation for Applied Medical Research (FIMA), University of Navarra (Pamplona, Spain). This work was partially funded by Consellería de Cultura, Educación e Ordenación Universitaria (Xunta de Galicia), and FEDER (European Union), and the Portuguese Foundation for Science and Technology (FCT) under the scope of the strategic funding of UID/BIO/04469/2013 unit and COMPETE 2020 (POCI-01-0145-FEDER-006684). We thank Iñigo Garciá -Yoldi for useful feedback and discussions during the preparation of the manuscript.info:eu-repo/semantics/publishedVersio

Universidade do Minho: RepositoriUM

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

GA4GH: International policies and standards for data sharing across genomic research and healthcare.

Author: Adams Jeremy B
Alterovitz Gil
Auvil Jaime M Guidry
Babb Lawrence J
Barkley Maxmillian P
Baudis Michael
Beauvais Michael JS
Beck Tim
Beckmann Jacques S
Beltran Sergi
Bernick David
Bernier Alexander
Birney Ewan
Bonfield James K
Boughtwood Tiffany F
Bourque Guillaume
Bowers Sarion R
Brookes Anthony J
Brudno Michael
Brush Matthew H
Bujold David
Burdett Tony
Buske Orion J
Cabili Moran N
Cameron Daniel L
Carroll Robert J
Casas-Silva Esmeralda
Chakravarty Debyani
Chaudhari Bimal P
Chen Shu Hui
Cherry J Michael
Chung Justina
Cline Melissa
Clissold Hayley L
Cook-Deegan Robert M
Courtot Mélanie
Cunningham Fiona
Cupak Miro
Davies Robert M
Denisko Danielle
Doerr Megan J
Dolman Lena I
Dove Edward S
Dursi L Jonathan
Dyke Stephanie OM
Eddy James A
Eilbeck Karen
Ellrott Kyle P
Fairley Susan
Fakhro Khalid A
Firth Helen V
Fitzsimons Michael S
Fiume Marc
Flicek Paul
Fore Ian M
Freeberg Mallory A
Freimuth Robert R
Fromont Lauren A
Fuerth Jonathan
Gaff Clara L
Gan Weiniu
Ghanaim Elena M
Glazer David
Goodhand Peter
Green Robert C
Griffith Malachi
Griffith Obi L
Grossman Robert L
Groza Tudor
Guigó Roderic
Guimera Roman Valls
Gupta Dipayan
Haendel Melissa A
Hamosh Ada
Hansen David P
Hart Reece K
Hartley Dean Mitchell
Haussler David
Hendricks-Sturrup Rachele M
Ho Calvin WL
Hobb Ashley E
Hoffman Michael M
Hofmann Oliver M
Holub Petr
Hsu Jacob Shujui
Hubaux Jean-Pierre
Hunt Sarah E
Husami Ammar
Jacobsen Julius O
Jamuar Saumya S
Janes Elizabeth L
Jeanson Francis
Jené Aina
Johns Amber L
Joly Yann
Jones Steven JM
Kanitz Alexander
Kato Kazuto
Keane Thomas M
Kekesi-Lafrance Kristina
Kelleher Jerome
Kerry Giselle
Khor Seik-Soon
Knoppers Bartha M
Konopko Melissa A
Kosaki Kenjiro
Kuba Martin
Lawson Jonathan
Leinonen Rasko
Li Stephanie
Lin Michael F
Linden Mikael
Liu Xianglin
Lopez Javier
Lucassen Anneke M
Lukowski Michael
Mann Alice L
Marshall John
Mattioni Michele
Metke-Jimenez Alejandro
Middleton Anna
Milne Richard J
Molnár-Gábor Fruzsina
Mulder Nicola
Munoz-Torres Monica C
Nag Rishi
Nakagawa Hidewaki
Nasir Jamal
Navarro Arcadi
Nelson Tristan H
Niewielska Ania
Nisselle Amy
Niu Jeffrey
North Kathryn
Nyrönen Tommi H
O'Connor Brian D
Oesterle Sabine
Ogishima Soichi
Page Angela JH
Paglione Laura AD
Palumbo Emilio
Parkinson Helen E
Philippakis Anthony A
Pizarro Angel D
Prlic Andreas
Rambla Jordi
Rehm Heidi L
Rendon Augusto
Rider Renee A
Robinson Peter N
Rodarmer Kurt W
Rodriguez Laura Lyman
Rubin Alan F
Rueda Manuel
Rushton Gregory A
Ryan Rosalyn S
Saunders Gary I
Schuilenburg Helen
Schwede Torsten
Scollen Serena
Senf Alexander
Sheffield Nathan C
Skantharajah Neerjah
Smith Albert V
Smith Lindsay
Sofia Heidi J
Spalding Dylan
Spurdle Amanda B
Stark Zornitza
Stein Lincoln D
Suematsu Makoto
Tan Patrick
Tedds Jonathan A
Thomson Alastair A
Thorogood Adrian
Tickle Timothy L
Tokunaga Katsushi
Torrents David
Törnroos Juha
Udara Liyanage Isuru
Upchurch Sean
Valencia Alfonso
Vamathevan Jessica
Varma Susheel
Vears Danya F
Viner Coby
Voisin Craig
Wagner Alex H
Wallace Susan E
Walsh Brian P
Wang Vivian Ota
Williams Marc S
Winkler Eva C
Wold Barbara J
Wood Grant M
Woolley J Patrick
Yamasaki Chisato
Yates Andrew D
Yung Christina K
Zass Lyndon J
Zaytseva Ksenia
Zhang Junjun
Publication venue: Cell Genom
Publication date: 01/01/2021
Field of study

The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution. We describe the GA4GH organization, which is fueled by the development efforts of eight Work Streams and informed by the needs of 24 Driver Projects and other key stakeholders. We present the GA4GH suite of secure, interoperable technical standards and policy frameworks and review the current status of standards, their relevance to key domains of research and clinical care, and future plans of GA4GH. Broad international participation in building, adopting, and deploying GA4GH standards and frameworks will catalyze an unprecedented effort in data sharing that will be critical to advancing genomic medicine and ensuring that all populations can access its benefits

The Jackson Laboratory: The Mouseion at the JAXlibrary

edoc

University of Northampton's Research Explorer

PubMed Central

Edinburgh Research Explorer